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ABSTRACT 

There  have  been  many  systems  developed  for  computer  process- 
ing of  natural  languages  such  as  English.     One  of  these,   known  as  NLP, 
is  being  developed  at  the  Naval  Postgraduate  School.     Another  system, 
based  on  Conceptual  Dependency  theory,   is  being  developed  at  Stanford 
University.     The  two  systems,   while  having  somewhat  similar  goals, 
use  different  internal  representations  of  information. 

The  purpose  of  this  thesis  was  to  devise  a  means  for  representing 
the  structures  of  Conceptual  Dependency  theory  in  NLP,   and  to  develop 
methods  for  conversion  of  information  between  NLP's  existing  repre- 
sentation and  that  of  Conceptual  Dependency  theory. 
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I.     INTRODUCTION 

There  has  been  much  research  devoted  to  the  development  of  lan- 
guage processors  that  will  allow  users  to  communicate  with  computers 
in  a  natural  language  such  as  English.     A  number  of  efforts  in  this  area 
are  reported  in  Refs.   5,    6,   and  7. 

Many  current  researchers  feel  that  the  key  to  developing  a  useful 
natural  language  processor  lies  in  having  a  "cognitive"  theory  of  lan- 
guage,  one  which  attempts  to  model  the  language  behavior  of  human 
beings.     One  such  theory  is  that  of  stratificational  linguistics  developed 
by  Lamb  [2]. 

This  report  deals  with  two  natural  language  processing  systems. 
Although  both  are  being  developed  generally  within  the  framework  of 
stratificational  linguistics,  they  use  different  representations  for  infor- 
mation found  in  natural  language  text.     The  first  of  these,    Natural 
Language  Processor  (NLP),   is  being  developed  at  the  Naval  Postgrad- 
uate School  by  G.   E.   Heidorn.     It  is  intended  to  be  a  general  natural 
language  processing  system,   and  it  uses  an  entity -attribute -value  form 
of  information-structure.     A  specific  application- -that  of  producing  a 
GPSS  program  from  an  English  description  of  a  queuing  problem r-is 
being  utilized  as  a  vehicle  for  its  development.     The  other  system, 
being  developed  at  Stanford  University  by  R .   C.   Schank,    is  called 
Conceptual  Dependency  theory,   and  defines  a  structure  using 
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"dependency  relations"  between  concepts  to  represent  the  meanings  of 
natural  language  text. 

The  objective  of  the  research  being  reported  on  in  this  thesis 
was  to  develop,  within  the  framework  of  NLP,    a  representation  for  the 
dependency  structures  of  Conceptual  Dependency  theory  and  also  to 
develop  procedures  for  converting  information  between  that  represen- 
tation and  the  existing  NLP  representation. 

The  remainder  of  the  thesis  is  divided  into  four  sections. 
Section  II  discusses  Conceptual  Dependency  Theory,   and  Section  III 
discusses  NLP.     Section  IV  describes  the  structure  and  rules  developed 
in  this  research.     Finally,   Section  V  presents  conclusions  and  recom- 
mendations. 


II.  CONCEPTUAL  DEPENDENCY  THEORY 

This  section  provides  an  introduction  to  Conceptual  Dependency 
Theory.     It  should  be  noted  that,   since  the  development  of  this  theory 
is  an  on-going  project,   the  concepts  and  rules  presented  in  this  section 
are  subject  to  modification  as  Schank's  work  progresses.     The  material 
for  this  section  was  taken  primarily  from  Refs.   3  and  4.     Some  por- 
tions,  notably  definitions,  were  taken  directly  from  the  reference 
material. 

A.         GENERAL  DISCUSSION 

Most  parsers  used  in  "conversation"  machines  have  been  syn- 
tactic parsers.     That  is,   they  analyze  an  input  sentence  and  construct 
a  syntactic  (grammatical)  network  from  it.     But  consider  the  sentence, 
"I  hit  the  boy  with  the  girl  with  long  hair  with  a  hammer  with  venge- 
ance. "    Clearly  the  syntactic  structure  of  this  sentence  will  not  provide 
the  information  necessary  to  get  at  the  meaning  of  it.     For  example,   if 
we  need  to  know  that  it  was  the  hammer  that  hit  the  boy,   we  would  have 
to  use  methods  more  sophisticated  than  syntactic  analysis. 

Computer  programs  that  process  natural  language  do  so  for  some 
purpose.     They  need  to  make  use  of  the  information  provided  by  a  sen- 
tence so  that  they  can  respond  properly.     T3ut  whatever  the  purpose,   it 
is  the  moaning  of  the  input  sentence  that  is  needed,   not  its  syntactic 
structure.     A  major  goal  of  this  work  is  tin;  at  tempi  to  analyze  natural 


language  into  meaning  (conceptual)  structures  that  are  unambiguous 
representations  of  the  meaning  of  an  input  utterance.     That  is,   any  two 
utterances  that  can  be  said  to  mean  the  same  thing,  whether  they  are 
in  the  same  or  different  languages,    should  be  characterized  in  only 
one  way  by  the  conceptual  structures.     The  representation  of  the  con- 
tent of  an  utterance  then,  must  be  in  interlingual  terms  that  are  as 
neutral  as  possible.     The  conceptual  base  is  responsible  for  formally 
representing  the  concepts  underlying  an  utterance  without  respect  to 
the  language  in  which  that  utterance  was  encoded.     The  concept(s)  that 
a  given  word  may  have  must  be  found  and  related  in  some  way  to  those 
concepts  denoted  by  other  words  in  a  given  utterance. 

There  are  two  distinct  levels  of  analysis  here  that  are  part  of  a 
stratified  (many-leveled)  system.     On  the  sentential  level,  the  utter- 
ances of  a  given  language  are  encoded  within  the  syntactic  structure  of 
that  language.     The  basic  construction  of  the  sentential  level  is  the 
sentence.     The  next  higher  level  in  the  system  is  the  conceptual  level, 
and  the  basic  construction  of  this  level  is  the  conceptualization.     A 
conceptualization  consists  of  concepts  and  certain  relations  that  exist 
between  these  concepts.     Underlying  every  sentence  in  a  language  there 
exists  at  least  one  conceptualization. 

B.         CONCEPTS 

The  basic  unit  of  the  conceptualization  is  the  concept.     There  are 
three  elemental  kinds  of  concepts --nominals,   actions,   and  modifiers. 


1.  Nominals 

Nominals  are  those  things  that  can  be  thought  of  by  them- 
selves without  the  need  for  relating  them  to  other  concepts.     A  word 
that  is  a  realization  of  a  nominal  concept  tends  to  produce  a  picture  of 
that  real-world  item  in  the  mind  of  the  hearer,    so  nominal  concepts  are 
called  PP's  (for  picture  producer).     A  PP  then,   is  the  concept  of  a  gen- 
eral thing,   e.g.,   a  man,    a  book,   a  mountain;  or  of  a  specific  thing-- 
John,   New  York,   or  the  Grand  Canyon. 

2.  Actions 

An  action  is  that  which  a  nominal  can  be  said  to  be  doing.     In 
order  for  a  concept  to  qualify  as  an  action  (ACT)  it  must  be  something 
that  an  animate  nominal  can  do  to  some  object.     Thus,    since  "John  hit 
Mary"  expresses  an  action  that  happened  to  Mary,    "hit"  is  an  ACT.     In 
"John  likes  Mary"  however,   nothing  happens  to  Mary,    so  "like"  is  not 
an  ACT. 

3.  Modifiers 

A  modifier  is  a  concept  that  makes  no  sense  without  the 
nominal  or  action  to  which  it  relates,    and  serves  to  specify  an  attribute 
of  that  nominal  or  action.     Modifiers  of  nominals  are  PA's  (picture 
aiders),   and  modifiers  of  actions  are  AA's  (action  aiders). 

C.         DEPENDENCIES 

Each  of  these  conceptual  categories  (PP,   ACT,    PA,   and  AA)  can 
be  related  in  specified  ways  to  one  another.     These  relations  are  called 
dependencies.     The  rule  of  thumb  in  establishing  dependency  relations 


between  two  concepts  is  whether  one  item  can  be  understood  without 
the  other.     A  governor  can  be  understood  by  itself.     However,  for  a 
conceptualization  to  exist,   even  a  governor  must  be  dependent  on  some 
other  concept  in  that  conceptualization. 

PP's  and  ACT's  are  inherently  governing  categories,  while  PA's 
and  AA's  are  inherently  dependents.     However,   governors  can  also  be 
dependent,   and  in  order  for  a  conceptualization  to  exist,   this  must  be 
the  case  for  at  least  two  governors. 

The  conceptual  base  is  represented  by  a  linked  network  of  con- 
cepts and  dependencies  that  is  called  a  conceptual  dependency  network. 
As  an  example  of  what  such  a  network  looks  like,   consider  the  sentence: 
"John  hit  his  little  dog.  "    "John"  is  the  name  of  an  object,   so  it  repre- 
sents a  concept  which  can  be  understood  by  itself,   and  it  is  thus  a  PP. 
"Hit"  represents  a  concept  of  action.     Each  of  these  concepts  is  neces- 
sary to  the  conceptualization.     Thus,   a  two-way  dependency  exists 
between  them.     That  is,   they  each  act  as  governors  which  can  be 
understood  by  themselves  but  which  must  both  be  present  in  order  to 
form  a  conceptualization.     The  two-way  dependency  is  denoted  by  <     >  . 
The  words  "his"  and  "little"  both  represent  dependent  concepts  in  that  in 
order  to  understand  them  it  is  necessary  to  hold  them  in  waiting  until 
what  they  modify  appears.     "Dog"  is  the  name  of  a  concept  which  is  a 
PP  and  is  therefore  a  governor.     The  PP  "dog"  is  conceptually  related 
to  the  ACT  "hit"  insofar  as  it  cannot  be  understood  with  respeel  to  Hie 
conceptualization  except  in  terms  of  "hit."      This  objective  dependency 
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o 

is  denoted  < .        So  the  network  to  this  point  is: 

o 
John  <£ — >  hit     < dog 

Now  the  dependents  that  were  waiting  for  "dog"  as  governor  can  be 

added.     "Little"  represents  a  PA  that  is  dependent  on  "dog.  "    This  is 

/Js 

called  attributive  dependency  and  is  denoted  by         .     The  concept  given 

by  "his"  would  appear  to  be  dependent  on  "dog"  as  well,   and  it  is,    but 
it  is  not  a  simple  concept.     "His"  is  really  another  syntactic  represen- 
tation of  the  PP  "John"  that  is  being  used  in  the  syntactic  form  that 
indicates  possession.     The  true  relation,   then,   is  one  PP  acting  as  a 
dependent  modifier  to  another  PP.     Prepositional  dependency  between 
two  PP's  is  denoted        with  a  label  indicating  the  type  of  prepositional 
dependency.     (POSS  indicates  that  the  governor  possesses  the  depen- 
dent. )    The  final  network  is: 

John  < — >  hit  < dog 

POSS 
little      John 

D.         CONCEPTUAL  RULES 

The  conceptual  level  is  intended  to  represent  the  concepts  and 
relations  between  concepts  that  underlie  natural  language  utterances. 
There  are  formally  defined  dependency  relations  between  given  cate- 
gories of  concepts,    and  these  relations  are  the  conceptual  rules. 
These  rules,   and  only  these,   make  up  the  formal  organization  of  the 
conceptual  networks  of  the  conceptual  level.     All  of  these  rules  ace 
summarized  in  Fig.    1  at  the  end  of  the  section. 
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1.  Actor -ACT  Dependency 

Since  a  conceptualization  expresses  an  event,  the  heart  of 
any  conceptualization  is  the  relationship  between  the  actor  and  the 
action  in  that  event.     Thus  rule  1  states  that  PP's  can  ACT  and  when 
they  do  there  is  a  mutual  dependence  between  them.     For  example,   in 
the  sentence  "John  hit  Mary,  "  the  two-way  dependency  is  shown  as 

John  <     >hit 

2.  ,  Attributive  Predication 

It  is  possible  to  predicate  an  attribute  about  a  particular  PP. 
This  is  called  an  attributive  conceptualization  and  the  relationship  is 
denoted  by  <      >  .     In  order  for  these  items  to  exist  as  a  conceptualiza- 
tion,  each  is  equally  necessary,   so  the  dependency  is  two-way.     For 
example, 

John  <==^>  tall 

3.  ACT  Membership 

It  is  also  possible  to  predicate  ACT  membership  between 
two  PP's: 

John  <'- — ;>  doctor 

4.  Attributive  Dependency 

It  is  possible  to  refer  to  a  concept  and  an  attribute  of  that 
concept  that  has  already  been  predicated.     In  discourse,    conceptual 
attributes  are  predicated,   either  explicitly  or  implicitly,   before  they 
are  used,   to  differentiate  concepts  of  the  same  linguistic  name.     For 
example,    "the  tall  man"  would  only  be  used  to  differentiate  two  men 
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whose  relative  height  is  either  visually  apparent  or  has  been  previously 
remarked  upon  in  a  predication.     "The  tall  man"  would  be  shown  as 

man  < tall 

5.  Location,   Possession,   and  Containment 

Two  conceptual  objects  in  the  world  can  be  related  to  each 
other  in  various  fashions.     The  three  principle  ones  are  location,   pos- 
session,   and  containment,   and  these  are  marked  on  the         arrows. 
For  example: 

man  book  water 

/fN  /Tk  /TK 

LOC  POSS  CONT 

New  York  John  bucket 

6.  Objective  Dependency 

Rule  6  indicates  objective  dependency.     The  PP  is  related  as 
object  to  the  ACT  which  governs  it.     For  example,   the  sentence  "John 
hit  Mary"  gives  the  network 

john<=!=>  hit  <-^-Mary 
(The  "p"  is  placed  over  the  arrow 4=^  to  indicate  that  the  event  took 
place  in  the  "past.  ") 

7.  Recipient  Case 

The  recipient  case,   which  is  dependent  on  the  ACT  through 
the  object,    is  used  to  denote  the  transition  in  possession  of  the  object 
from  the  originator  to  the  recipient.     A  concept  like  "give"  or  "take" 
obviously  requires  a  recipient  case,   although  the  original  possessor 
or  the  recipient  may  be  only  implied  or  may  be  unknown.     For  example, 
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in  the  sentence  "I  gave  the  man  a  book"  both  the  originator  and  the 

recipient  are  known: 

p                   o                         r  -       '         man 
I  <     >  give  < book       < 


I 


from 

However,   in  the  sentence  "the  man  took  a  book",   the  donor  is  unknown: 

->  man 


man <===>  take  < book    ^ 


R 


-<   (x) 

It  can  be  seen  that  these  two  conceptualizations  look  very 
much  alike.     Conceptually,   the  same  underlying  action  has  occurred-- 
the  transfer  of  an  object  from  one  person  to  another.     The  difference 
lies  only  in  the  direction  of  transfer.     But  by  realizing  that  direction 
can  be  determined  by  comparing  the  actor  with  the  donor  (or  recipient) 
of  the  conceptualization,   it  would  seem  reasonable  to  be  able  to  replace 
"give"  and  "take"  by  a  new  ACT  "TRANS".     For  example,    the  first 
conceptualization  above  would  be  realized  as 

to 


p                          o  R   r-  MAN 

I  <=4>  TRANS    < BOOK   <— 


-<  I 

from 

and  the  second  example  as 

to 

p                        o  H   —  MAN 

MAN  <=> TRANS  < BOOK    <r^- 


-<       SOMEONE 
from 

"Give"  can  then  be  defined  as  "TRANS"  where  actor  and 

donor  are  the  same,   and  "take"  is  "TRANS"  where  actor  and  recipient 

are  the  same. 


Many  other  verbs  besides  "give"  and  "take"  can  be  realized 
as  "TRANS"  plus  other  requirements  (e.g.,    "buy",    "sell",   and  "steal"), 
but  "give"  and  "take"  are  the  only  ones  being  presently  considered.     It 
would  seem  that  being  able  to  map  many  concepts  into  a  single  concept 
without  losing  any  information  would  greatly  simplify  the  conceptual 
network. 

8.       Instrumental  Case 

■  Rule  8  shows  the  instrumental  case.     The  instrument  of  an 
ACT  is  a  conceptualization,   and  represents  the  means  by  which  the 
main  action  was  completed.     Every  ACT  requires  an  instrument,   but 
more  often  than  not  the  instrument  is  implied  rather  than  explicitly- 
stated.     For  example,   in  the  sentence  "John  hit  Mary",   the  instrument 
is  not  stated.     But  in  the  sentence  "John  hit  Mary  by  throwing  a  stick", 
the  entire  instrumental  conceptualization  is  present.     The  conceptual 
network  for  this  sentence  would  be 

John 

John  <-*-^>  hit  < Mary  < \      p 

throw 


O 


stick 

9.       Directive  Case 

The  directive  case  indicates  that  PP's  may  serve  as  the 
directional  indicators  of  a  directional  action.     In  the  example  above, 
the  action  "throw"  takes  the  directive  case  even  though  there  is  no 
explicit  mention  of  direction.     Tn  order  for  the  sentence  i"  i  te] 

express  the  conceptualization,   if  would  have  to  be  "John  hit  Mary  by 
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throwing  a  stick  at  her",   and  the  network  would  be 


John  <===>  hit  < Mary  < 


John 
thr  ow 


stick     <r 


D 


->    Mary 


-<     John 


10.    Causality 

Causality  is  denoted 


r 


,   and  indicates  that  the  governing 
conceptualization  caused  the  dependent  conceptualization  to  happen. 
As  an  example,   the  sentence  "John  was  sad  because  Mary  left  him" 
yields  the  network 

->    (x) 


P  D 

Mary<^==>  go      < — 


— < John 
John<^=b>  sad 

Another  example  is  the  sentence  "Sam  flew  his  plane  to  New  York". 

The  network  for  this  sentence  is 

Sam  /  P>  do 


plane     <_L_>   fly    < — 
POSS 

Sam 


->  New  York 


<  (x) 


The  "do"  in  the  network  is  a  dummy  standing  tor  an  ACT  which  was 
not  stated--in  tin's  case,   the  actions  which  arc  required  to  pilot  a 
plane. 
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11.    Time  of  Conceptualization 

This  rule  relates  another  conceptual  category  T  (for  concepts 
like  "yesterday"  and  "3  o'clock")  and  a  conceptualization.     The  time  of 
something  modifies  the  entire  conceptualization  and  not  any  particular 
item  in  it,   thus  it  is  the  time  of  the  joining  together  of  the  actor  and 
the  ACT. 

yesterday 

4/ 
John       -^U    hit    <r^—    Mary 

12.      Concurrent  Conceptualizations 

This  rule  is  similar  to  rule  11,   except  that  here  am  entire 

conceptualization  is  the  time  of  another.     This  dependency  is  usually 

realized  in  English  by  "while".     "Mary  cried  while  John  hit  her" 

yields  the  conceptual  network 

John  ^2=^  hit  <^—  Mary 


Mary< — >  cry 
13.       Location  of  Conceptualization 

Rule  13  indicates  that  conceptualizations  must  occur  some- 
place.    For  example,    "Mary  hit  John  in  the  park"  yields 

park 


n                  ° 
Mary     <-^->  hit    < John 
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14.      Change  of  State 

The  final  rule  expresses  the  fact  that  an  object  in  the  world 
has  changed  its  state  in  some  way.     The  sentence  "John  grew  the  plants" 
would  yield  the  network 


John  <r      >  do 


plants  <^ 


->     height  y 


(where  y  >  x) 


<     height  x 


("Grow"  is  a  state  change,    and  not  an  action  that  someone  can  perform, 
hence  the  dummy  "do".  ) 
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III.     NLP--NATURAL  LANGUAGE  PROCESSOR 

NLP  is  a  general  language  processing  system  consisting  of  an 
IBM  360  Fortran  program  and  a  "rule  language.  "    Sets  of  "decoding" 
and  "encoding"  rules  must  be  written  to  specify  how  processing  is  to 
be  done  for  a  given  application.     Decoding  rules  specify  how  input  text 
is  to  be  processed  to  produce  an  entity -attribute -value  data  structure 
(called  an  Internal  Data  Structure--IDS),   and  encoding  rules  specify 
how  to  produce  output  text  from  the  IDS. 

Several  of  the  Fortran  routines  act  as  a  "compiler"  for  the  rule 
language.     They  are  used  to  process  decoding  and  encoding  rules  and 
convert  them  into  their  internal  representation.     Other  routines,  which 
are  used  at  execution  time,   perform  encoding  and  decoding  according 
to  this  information. 

A  complete  description  of  NLP  is  found  in  Ref.    1.     This  section 
presents  an  overview  of  NLP,  with  some  portions  coming  directly  from 
Ref.   1. 

A.         ENTITY -ATTRIBUTE- VALUE  STRUCTURE 

The  IDS  is  designed  to  hold  information  (produced  from  input 
text)  in  a  language-independent  form.     All  of  the  information  is  held  in 
"records",  which  arc  just  lists  of  attribute -value  pairs.     Some  records 
represent  physical  entities,   such  as  "car",   "ball",   or  "man".     Ot) 
represent  abstract  entities  such  ctior   .     There  i:;  a  record  for  each 
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word  in  the  structure,   and  the  values  of  the  attributes  of  the  record 
specify  the  characteristics  of  the  word.     For  example,   the  record  for 
the  word  "arrive"  could  have  a  part- of- speech  attribute  with  the  value 
"verb",   and  a  type  attribute  with  the  value  "intransitive".     There  is 
also  a  record  for  each  concept  in  the  structure.     The  attribute  which 
relates  concepts  to  other  concepts  is  called  the  SUPerset  attribute. 
For  example,   the  SUP  attribute  of  a  record  representing  the  concept 
"car"  might  point  to  another  record  representing  the  concept  "vehicle, 
and  the  SUP  of  vehicle  might  be  "entity.  "    This  SUP  chain  specifies 
that  "car"  is  a  type  of  "vehicle,  "  and  "vehicle"  is  a  type  of  "entity.  " 

The  use  of  records  is  not  limited  to  the  representation  of  just 
single  words  and  concepts,  however.     A  record  may  describe  the 
information  contained  in  an  entire  "conceptualization.  "    For  example, 
the  sentence  "John  gave  a  book  to  the  girl.  "  could  be  described  by  the 
record 


SUP 

give 

AGENT 

John 

GOAL 

book 

RECIPIENT 

girl 

PAST 

There  are  many  attributes  which  action  records  such  as  these 
might  have,    to  hole]  the  informal  ion  prosonl    in  :i   nnuvpl  ual  i  /at  ion. 
These  include  AGENT,    GOAL,    LOCATION,     CTME,    SOURCE,    DESTINA- 
TION,  DONOR,    KKni'iiA'T.   I5EASON.    INSTRUMENT,   and  CONCUR- 
RENT.   The  values  of  some  of  these  are  entities  (i.e.,    PP's  In  Con- 
ceptual Dependency  terminology),   and  the  -  ther 
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action  records.     Entities  are  represented  by  records  which  have  attri- 
butes such  as  LOCATION,    COLOR,    SIZE,    QUANTITY,   WEIGHT, 
SPEED,    OWNER,    and  CONTAINER. 

A  special  type  of  attribute  used  in  NLP  is  the  "indicator,  "  so 
called  because  it  indicates  the  existence  of  a  particular  condition.     For 
example,    a  record  describing  the  word  "cars"  could  be  said  to  have 
its  PLURal indicator  "on."  In  the  example  record  given  above,   PAST 
is  an  indicator. 

Most  of  the  information  about  words  and  concepts  is  initially 
entered  into  the  system  by  means  of  named  record  definitions.     A 
typical  named  record  definition  is 

TAKC EVENT',    E,    ES,    ING,    EN,    ER,    TRANS) 
This  example  defines  a  record  named  "TAK"  which  has  a  SUP  attribute 
pointing  to  the  named  record  'EVENT'  and  has  the  indicators  E,    ES, 
ING,    EN,    and  ER,    indicating  that  'TAK'  can  have  these  endings.     It 
also  has  the  indicator  TRANS,   meaning  that  'TAK'  is  a  transitive 
verb.     All  these  indicators  provide  information  about  the  word  "take", 
while  the  SUP  attribute  provides  information  about  the  concept,  "take". 

The  most  important  feature  of  named  records  is  that  they  may  be 
referred  to  by  name  both  from  other  named  records  and  from  the  en- 
coding and  decoding  rules. 

B.         THE  RULE  LANGUAGE 

The  purpose  of  NLP,   basically,   Is  to  converl  Information  from 
one  form  to  another.    The  two  differenl  forms  of  Information  thai  it 


works  with  are  records  and  character  strings.     The  process  of  con- 
verting character  strings  into  a  record  structure  representing  the 
meaning  of  the  input  is  called  "decoding".     The  inverse  process  is 
called  "encoding". 

The  processing  done  in  the  system  is  specified  by  sets  of  rules 
written  in  a  rule  language.     Decoding  rules  specify  how  input  character 
strings  are  to  be  converted  to  records,    and  encoding  rules  specify  how 
records  are  to  be  converted  to  character  strings.     Either  kind  of  rule 
can  be  used  to  specify  how  records  can  be  converted  to  other  records. 
For  the  application  being  reported  on  in  this  thesis,   encoding  rules  are 
used  to  specify  all  of  the  processing.     A  complete  explanation  of  the 
rule  language  and  description  of  the  application  of  the  rules  is  given 
in  Ref.    1. 

A  rule  consists  of  a  left  part  and  a  right  part,    separated  by  an 
arrow  (-->).     In  general,   the  left  part  specifies  what  conditions  must 
exist  in  order  to  apply  the  rule,    and  the  right  part  tells  what  to  do  when 
the  rule  can  be  applied. 

As  typical  examples  of  the  rules  written  in  the  NLP  rule  language, 
consider  the  following: 

(1)  VERB('BE')         VERBPH(PASTPART)         --> 

VERBPIRPASSIVE,    VFORM=VFORM(VERB)) 

(2)  VERBPH(PASSIVE)     --> 

VERBCBE1,   VFORM=VFORM(VER  BPH)) 
VER  BTTK-  PASSIVE,    -VFORM,    PAST  I  'AH  T) 
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The  first  example  is  a  decoding  rule.     It  says  that  any  form  of  the  verb 
"be"  can  be  put  together  with  a  past-participle  verb-phrase  to  create  a 
new  verb-phrase  that  has  all  the  characteristics  of  the  old  verb-phrase, 
except  that  it  is  passive  and  has  the  same  verb-form  (e.g.,    present- 
third-person-singular)  as  the  verb  on  the  left.     For  example,   this  rule 
would  apply  to  the  phrase  "is  unloaded". 

The  second  rule  is  an  encoding  rule,   and  says  that  a  passive  verb- 
phrase  is  to  be  expanded  to  a  verb  which  is  a  form  of  "be"  (with  the 
particular  verb-form  coming  from  the  verb-phrase),   followed  by  a  new 
verb-phrase  which  has  all  the  characteristics  of  the  old  verb-phrase, 
except  that  it  is  not  passive  and  it  has  past-participle  in  place  of  its 
old  verb-form.     The  use  of  encoding  rules  will  be  seen  in  the  next 
section. 
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IV.     IMPLEMENTATION 

This  section  describes  the  information  structure  devised  for 
representing  a  Conceptual  Dependency  conceptualization  in  NLP  and 
describes  the  processing  required  to  convert  a  conceptualization 
from  its  usual  NLP  form  into  this  form  and  vice  versa. 

A.         THE  INFORMATION  STRUCTURE 

A  conceptualization  is  represented  in  the  Internal  Data  Structure 
of  NLP  as  a  set  of  records  representing  the  concepts  of  that  concep- 
tualization.    The  action  is  considered  to  be  the  most  important  building 
block  in  the  structure.     Consequently,   there  is  a  special  record 
('ACTNLIST')  in  which  there  are  attributes  pointing  to  each  action 
record.     Each  of  these  action  records  has  attributes  linking  the  action 
to  the  other  concepts  in  the  conceptualization.     Figure  2  shows  a  graphic 
portrayal  of  the  IDS  representation  for  the  conceptualization  "John  gave 
Mary  a  red  book,  "  and  the  records  Rl  and  R2  that  would  be  created  by 
NLP  during  the  decoding  of  the  English  sentence.     The  other  records 
would  have  been  part  of  the  named  records  already  in  the  system. 
Additionally,   Rl  would  be  pointed  to  by  'ACTNLIST'. 

In  the  work  done  here  it  was  decided  to  represent  each  relation 
in  a  conceptualization  of  Conceptual  Dependency  theory  by  an  NLP 
record  with  specified  attributes.     For  example,   a  record  representing 
a  rule  1  relation  would  have  a  SUP  attribute  of  'REL1'  and  attributes 
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PP  and  ACT  to  point  to  the  associated  concepts.     Similarly,   a  rule  7 
relation  yields  a  'REL7'  with  attributes  of  ACT,    PPDON,    and  PPREC. 
Figure  3  shows  the  dependency  relations  and  the  attributes  specified 
for  each  relation.     In  addition  to  these  attributes,   certain  other  attri- 
butes defined  in  NLP  may  be  used  as  necessary  (e.g.,    PAST,    PLUR). 

The  three  parts  of  rule  5  (LOCation,   CONTainment,   and  Poss- 
ession) were  defined  separately  because  it  was  felt  that  the  preposition 
associated  with  location  is  extremely  important  to  the  conceptualization, 
whereas  it  is  usually  understood  in  the  case  of  possession  and  contain- 
ment.    For  example,   the  sentence  "John  was  sitting  at  the  bar"  takes 
on  entirely  different  meanings  when  the  location  preposition  "at"  is 
replaced  by  "behind",    "on",    "above",   or  "under.  " 

Figure  4  shows  the  representation  of  an  example  conceptualization 
in  the  structure  developed  for  the  dependency  relations.     The  conceptual- 
ization shown  there  is  the  same  as  that  in  Figure  2.     The  record  R EL- 
LIST  has  pointers  to  each  of  the  REL  records,   and  the  attribute  LASTREC 
says  that  attribute  14  (@14)  is  the  final  pointer  attribute  on  the  list.     The 
SUP  of  each  REL  record  indicates  which  conceptual  relation  the  record 
represents,    and  the  other  attributes  point  to  the  associated  concepts.     In 
the  figure,   records  Rl,   R2,  R3,    and  R4  represent  the  concepts  involved. 
The  SUP  attributes  of  records  D1-D5  point  to  the  same  named  records 
which  are  shown  in  Figure  2  ('BOOK',    'JOHN',    'MARY',   etc.).     These 
named  records  were  left  off  this  drawing  for  lack  of  space. 
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B.         ATTRIBUTE-TO-RELATION  PROCESSING 

This  section  deals  with  the  conversion  of  an  NLP  entity-attribute- 
value  structure  to  a  relation  structure  of  Conceptual  Dependency  theory. 
The  NLP  encoding  rules  which  do  the  processing  are  listed  in  Fig.    5. 

The  basic  reasoning  underlying  the  development  of  these  rules 
was  that  the  dependency  relations  could  be  determined  by  examining  the 
records  in  the  NLP  representation  of  a  conceptualization.     For  example, 
all  of  the  relations  involving  ACT's  could  be  constructed  using  action 
records,  which  are  all  conveniently  referenced  in  the  single  record 
'ACTNLIST'.     The  rest  of  the  relations  involve  either  PP's  or  PA's. 
The  PP's  are  available  on  one  of  two  lists--' MO BLIST'  or  "STALIST' 
(lists  constructed  by  NLP  during  decoding  which  point  to  all  entities, 
both  MO  Bile  and  STAtionary).     The  PA's  can  be  found  as  attributes  of 
the  records  pointed  to  by  one  of  the  three  lists  mentioned. 

In  general,   the  processing  is  done  as  follows:    a  record  is  taken 
from  the  stack  of  records  to  be  processed,    and  examined  for  attributes 
needed  to  construct  relation  records.     If  one  is  found,   the  appropriate 
REL  record  is  built  and  put  on  the  'RELLIST'.     The  original  record, 
minus  the  attribute  that  caused  the  creation  of  this  REL  record,   is  put 
back  on  the  top  of  the  stack  to  be  processed  again  for  other  possible 
relations.     By  erasing  these  attributes  as  relations  are  found,   multiple 
processing  by  a  given  rule  is  avoided.     The  records  must  be  put  back 
on  the  stack  if  they  have  not  been  "examined"  by  all  of  the  rules, 
because  they  may  represent  more  than  one  relation.     When  a  record  is 
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found  to  have  none  of  the  attributes  necessary  for  construction  of  a 
relation,   it  is  deleted  from  the  processing  stack,   and  the  next  record 
is  taken  from  the  top  of  the  stack  and  processed.     The  operation  of  the 
stack  is  basic  to  the  encoding  process  of  NLP  and  is  described  in  detail 
in  Ref.    1. 

The  first  three  processing  rules  shown  in  Fig.    5  result  in  copies 
of  the  records  in  the  three  lists  ('ACTNLIST',     'MOBLIST',   and 
'STALIST')  being  placed  on  the  processing  stack.     Additionally,   each 
of  these  records  is  given  another  attribute  ('ENTY')  which  points  to 
the  original  record.     Rules  4,    5,   and  6  cause  each  record,   as  it  comes 
off  the  stack,   to  be  classified  as  an  action  record  (ACTRECC)  or  an 
entity  record  (ENTRECC).     Rules  7  through  30  recognize  the  presence 
of  dependency  relations  and  create  appropriate  RELation  records 
(RELREC),    and  rule  31  causes  the  newly  created  RELRECs  to  be 
placed  on  the  list  'RELLIST'. 

1.      Rule  Processing  Example 

To  illustrate  the  application  of  the  processing  rules,    con- 
sider the  sentence 

"John  ate  a  red  apple.  " 
The  following  records  would  be  present  in  the  IDS  as  a  result- of  the 
decoding  of  this  sentence  by  NLP: 

Rl  ('EAT',  AGENT  =  'R2\  GOAL  =  'R3',  PAST) 

R2  ('JOHN') 

R3  ('APPLE',  COLOR  =  'RED') 
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Since  Rl  is  an  action  record,   it  would  be  pointed  to  by  '  ACTNLIST'. 
R2  would  be  on  'MOBLIST',   and  'STALIST'  would  have  an  attribute 
pointing  to  R3.   Assume  that  each  list  has  just  the  one  record  in  it, 
pointed  to  by  attribute  11  (@11)  of  the  list,    and  that  attribute  LASTREC 
of  each  list  is  also  11.     Processing  would  proceed  as  follows.     Rule  1 
would  create  3  records,   each  named  RECLISTC.     The  first  would 
consist  of  a  copy  (%)  of  'ACTNLIST',    an  attribute  called  LIST  pointing 
to  the  list  'ACTNLIST',   and  an  attribute  LC  set  to  11.     These  three 
records  would  be  placed  on  the  processing   stack  in  inverse  order, 
i.e.,   the  first  created  is  on  top,   the  second  one  created  is  next,    and 
so  on.     (If  more  than  one  record  is  created  by  any  given  rule,   the 
records  are  always  put  on  the  stack  in  this  manner. )    The  stack  now 
consists  of 

STACK 

51  RECLISTC(@11  =  'R1\    LASTREOll,    LIST -'ACTNLIST1,    LC  =  11) 

52  RECLISTC(@11  =  'R2',    LASTREC  =  11,    LIST  =  ' MOBLIST',    LC  =  11) 

53  RECLISTC(@11  =  'R3',    LASTREC  =  11,    LIST  =  ' STALIST',    LC  =  11) 

(The  designators  SI,    S2,    and  S3  are  not  a  part  of  the  rules --they  are 
being  used  here  to  indicate  the  order  of  creation  of  the  rules.  ) 

The  next  step  in  processing  is  to  take  SI  off  the  stack  and 
examine  it  to  see  which  of  the  rules  applies  to  it.     Rules  2  and  3  both 
apply  to  records  with  the  name  'RECLISTC.     Rule  2  says  that  if  a 
'RECLISTC  record  has  an  attribute  LC  which  is  less  than  or  equal  in 
value  to  the  LASTREC  attribute,   then  execute  the  "creation  specifica- 
tions" on  the  right  of  the  arrow  (-->).     Since,    in  SI,    LC  is  equal  to 
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LASTREC,   the  condition  is  satisfied,    so  the  rule  is  executed,   resulting 
in  a  RECC  record. 

The  first  creation  element  is 

%@LC(RECLISTC)(RECLISTC) 
This  says  "copy  the  record  pointed  to  by  attribute  (LC(RECLISTC))  of 
RECLISTC,  "  where  "RECLISTC"  refers  to  the  record  being  processed 
(SI,   in  this  case).     LC  of  SI  is  11;     attribute  11  of  SI  is  'Rl',   so  the 
RECC  begins  as  a  copy  of  'Rl'.     The  second  creation  element  says  to 
copy  the  LIST  attribute  of  the  RECLISTC  into  this  record,    and  the  third 
copies  the  LC  attribute  from  SI.     The  last  specification  says  to  set  the 
ENTY  attribute  in  this  record  equal  to  the  value  of  @LC(RECLISTC) 
(RECLISTC).     Again,    LC(S1)  is  11,   and  @11  of  SI  is  'Rl',    so  the  ENTY 
attribute  of  RECC  points  to  Rl. 

Rule  2  also  says  to  put  the  current  RECLISTC  record  back 

on  the  stack  and  increment  its  LC  attribute.     At  this  point,   the  stack 

consists  of 

S4     RECCCEAT',   AGENT  =  'R2',   GOAL='R3\    PAST,    LIST  = 
'ACTNLIST',    LC  =  11,    ENTY  =  'R1') 

51  RECLISTC(@11  =  'R1',    LASTREC  =  11,    LIST  =  ' ACTNLIST', 

LC  =  12) 

52  RECLISTC(@11  =  'R2\    LASTREC  =  11,    LIST  =  'MOBLIST', 

LC  =  11) 

53  RECLISTC(@11  =  'R3',    LASTREC  =  11,    LIST  =  'STALIST\ 

LC-11) 

Next,    S4  is  taken  from  the  stack,    and  the  rules  are  searched  to  find 
one  that  applies  to  a  "RECC"  record.     The  rules  are  always  searched 
in  order,    so  that  the  first  one  that  applies  will  be  used.     It  is  seen  in 
Fig.    5  that  rules  4,    5,   and  6  are  the  only  possible  rules  that  can  be 
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applied  at  this  point.     The  condition  that  must  be  met  for  rule  4  to  be 

applicable  is  that  the  LIST  attribute  of  the  RECC  be  equal  to  'ACTNLIST', 

and  this  condition  is  satisfied  in  S4,   so  a  record  named  ACTRECC  is 

created  as  a  copy  of  the  current  RECC  and  put  on  the  stack: 

S5      ACTRECC  ('EAT',    AGENT='R2',   GOAL  =  'R3',    PAST, 
LIST='ACTNLIST\    LC  =  11,    ENTY  =  'R1') 

51  (same  as  before) 

52  (same  as  before) 

53  (same  as  before) 

As  was  mentioned  earlier,   the  purpose  of  the  first  six  rules 
is  to  construct  ACTRECCs  and  ENTRECCs  that  can  be  examined  for 
dependency  relations.     It  was  also  stated  that  ACTRECCs  will  yield 
those  relations  involving  ACTs,    and  that  any  relations  not  involving 
ACTs  must  come  from  the  ENTRECCs.     The  rest  of  the  rules  in  Fig.    5 
are  divided  into  two  groups:    those  that  process  ACTRECCs  and  those 
that  process  ENTRECCs.     The  application  of  these  rules  will  be  illus- 
trated by  continuing  with  the  example. 

The  ACTRECC  (S5)  is  removed  from  the  stack  and  the  rules 
are  searched  (from  the  beginning)  for  one  in  which  the  condition  speci- 
fication (if  any)  is  satisfied.     The  first  rule  that  processes  ACTRECCs 
is  rule  7,    but  it  specifies  that  the  current  ACTRECC  must  have  a 
DONOR  attribute.     (Asking  if  a  particular  attribute  "exists"  means 
"does  this  attribute  have  a  value  other  than  zero?")    Since  S5  does  not 
"have"  a  DONOR  attribute,   the  rule  search  is  continued  until  rule  13 
is  reached.     S5  does  have  an  AGENT  attribute,   so  this  rule  applies. 
A  record  called  RELREC  is  created  with  four  attributes.     The  first  is 
a  SUP  attribute  of  'REL1'  indicating  that  this  new  record  represents  a 
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type  1  dependency  relation  (PP  <£==>  ACT).     The  ACT  attribute  is  set 
equal  to  the  ENTY  attribute  of  the  current  ACTRECC  by  the  creation 
element  "ACT=ENTY( ACTRECC)",   and  the  PP  attribute  is  set  to  the 
AGENT  attribute  of  S5.     The  fourth  creation  element,    "ONEPTR 
(ACTRECC)-SEG",    says  to  set  the  ONEPTR  attribute  of  the  current 
ACTRECC  to  point  to  the  record  being  created,   i.e.,   this  RELREC. 
(The  reason  for  this  "back-pointer"  will  become  apparent  when  rules 
15  and  16, are  discussed.)    The  final  creation  element  says  to  make 
the  VERBPHIND  ("verbphrase  indicator")  attribute  of  this  RELREC 
equal  to  the  VERBPHIND  of  the  ACTRECC.     The  VERBPHIND  includes 
all  indicators,   such  as  PAST,    FUTURE,    SING,    and  PLUR,   that 
describe  verbs.     In  this  example,   the  only  one  of  these  indicators  that 
is  "on"  in  S5  is  PAST,   so  Pi\ST  is  set  in  RELREC.     (If  none  of  the 
indicators  of  this  form  had  been  "on",   i.e.,    VERBPHIND  was  zero, 
then  the  VERBPHIND  in  this  RELREC  would  have  been  set  equal  to 
zero,   also.     In  that  case,   it  would  be  said  that  there  was  "no"  YER  B- 
PHIND  indicator  present.  ) 

This  completes  the  creation  of  the  RELREC.     The  other  part 
of  this  rule  "ACTRECC(-AGENT,    -AGENT(ENTY),    -VERBPHIND 
(ENTY))"  says  to  "erase"  the' AGENT  attribute  of  the  current  ACTRECC 
(i.e.,    set  it  to  zero),    and  erase  the  AGENT  and  VERBPHIND  attributes 
of  the  record  pointed  to  by  the  ENTY  attribute  of  this  ACTRECC.     ENTY 
of  S5  points  to  Rl,    so  now  Rl  looks  like  this: 

Rl  ('EAT',    GOAL  =  'R3') 
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The  RELREC  that  was  just  created  and  the  "new  version"  of  S5  are 

returned  to  the  stack: 

S6      RELREC  ('REL1',   ACT  =  'R1\   PP='R2',    PAST) 
S5      ACTRECC  ('EAT',    GOAL='R3',    PAST,    LIST  =  'ACTNLIST', 
LC  =  11,    ENTY  =  'R1',    ONEPTR  =  'S6') 

51  (Same  as  before) 

52  (Same  as  before) 

53  (Same  as  before) 

It  should  now  be  apparent  why  "AGENT"  was  erased  from  S5:    the  next 
time  it  came  off  the  stack  with  an  AGENT  attribute,   it  would  be  proc- 
essed by  rule  13  again,   and  put  back  on  the  stack--an  "infinite  loop.  " 
The  reason  for  erasing  the  AGENT  attribute  from  Rl  will  be  seen  later. 

The  next  step  in  the  process  is  to  remove  S6  from  the  stack 
and  find  an  applicable  rule.     The  only  rule  that  handles  a  RELREC  is 
rule  33,   and  since  there  are  no  conditions  to  be  met  before  processing, 
rule  33  is  executed.     The  "NULL"  on  the  right  says  that  after  executing 
the  creation  elements  (if  there  are  any),    do  not  put  the  current  record 
back  on  the  stack. 

The  creation  elements  in  rule  33  sa}r  to  increment  the  LAST- 
REC  attribute  of  the  record  'RELLIST'  (it  was  set  to  the  value  of  the 
last  attribute  being  used),   make  this  new  attribute  point  to  the  current 
RELREC,    S6,   and  increment  LASTREC.     It  was  mentioned  earlier  in 
the  discussion  that  'RELLIST'  is  used  as  a  list  to  keep  track  of  the 
relation  records. 

The  next  record  on  the  stack  is  S5,   which  is  taken  off  to 
become  the  "current"  record.     This  time,   in  the  rule  search,   the 
condition  specification  of  rule  13  is  not  satisfied,    since  S5  no  longer 
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has  an  AGENT  attribute.     However,   S5  does  have  a  GOAL  attribute,    so 

rule'  14  is  executed,    resulting  in  a  RELREC  with  a  SUP  of  'REL6',   an 

ACT  pointing  to  Rl,    and  a  PPOBJ  pointing  to  the  GOAL  of  the  ACTRECC, 

R3.     (This  RELREC  represents  the  relation  EAT   <e^- APPLE.  )    Then  the 

GOAL  attributes  are  erased  from  S5  and  Rl,   and  the  'REL6'  just  created 

is  put  on  the  stack,   along  with  S5. 

S7      RELREC  ('REL6',    ACT  =  'R1',    PPOBJ  =  'R3') 
S5      ACTRECC  ('EAT',    PAST,    LIST=' ACTNLIST', 
LC  =  11,    ENTY  =  'R1',    ONEPTR  =  'S6') 

51  (Same  as  before) 

52  (Same  as  before) 

53  (Same  as  before) 

S7  is  taken  off  the  stack  and  rule  33  is  executed,   adding  the 
new  relation  record  to  'RELLIST'. 

When  the  ACTRECC  comes  off  the  stack  and  the  rules  are 
searched,   it  is  found  that  none  of  the  ACTRECC  rules  that  specify  a 
particular  condition  (rules  7-22)  are  satisfied,   which  means  that  the 
current  ACTRECC  is  no  longer  needed- -it  has  supplied  all  the  informa- 
tion it  can  supply.     Rule  23  is  executed  and  takes  care  of  this  situation 
by  not  putting  the  ACTRECC  back  on  the  stack.     The  stack  has  now  been 
reduced  to 

51  RECLISTC  (@11  =  'R1',    LASTREC  =  11,    LIST  =  ' ACTNLIST',    LC  =  12) 

52  RECLISTC  (@ll='E2'f    LASTREC  =  11,    LIST  =  'MOBLtST\    LC  =  11) 

53  RECLISTC  (@11  =  'R3',    LASTREC  =  11,    LIST  =  'STALIST\    LC  =  11) 

SI  is  the  next  record  off  the  stack.     This  time  rule  2  does  not 


apply,   because  LC  is  greater  than  LASTREC,    so  rule  3  is  executed  and 
SI  is  deleted  from  processing.     If  'ACTNLIST'  had  more  than  one  record, 
SI  would  go  back  on  the  stack  after  the  next  'ACTRECC  had  been  created, 
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and  the  looping  process  would  continue  until  all  of  the  action  records 
on  'ACTNLIST'  had  been  processed  and  all  of  the  relations  involving 
ACTs  had  been  created  and  added  to  'RELLIST1. 

S2  would  be  processed  next,   and  after  rules  2  and  5  had 
created  an  ENTRECC,   the  stack  would  look  like  this: 

58  ENTRECC  ('JOHN',    LIST  =  'MOBLIST',    LC=11,    ENTY='R2') 

52  RECLISTC  (@11  =  'R2\    LASTREC  =  11,    LIST='MOBLIST',    LC  =  12) 

53  RECLISTC  (@11  =  'R3',    LASTREC  =  11,    LIST  ='STALIST',    LC=11) 

, When  S8  comes  off  the  stack,   it  would  ignore  the  first  23  rules 
because  they  are  not  concerned  with  ENTRECCs.     The  next  eight  rules 
would  not  apply  because  S8  has  none  of  the  attributes  they  require,    and 
rule  32  would  prevent  this  ENTRECC  from  being  considered  again  for 
processing. 

S2  comes  off  the  stack  again,    but  is  discarded  because  LC  is 
greater  than  LASTREC,   leaving  only  S3  to  be  processed.     Record  S3 
yields  an  ENTRECC  which  is  placed  on  the  stack: 

59  ENTRECC  ('APPLE',    COLOR  =  'RED',    LIST  =  'STALIST', 

LC  =  11,    ENTY  =  'R3') 
S3      RECLISTC  (@11  =  'R3',    LASTREC  =  11,   LIST  =  'STALIST',    LC  =  12) 

When  S9  is  removed  from  the  stack  and  the  rules  are  searched, 
it  is  found  to  satisfy  the  condition  of  rule  25,   i.e.  ,   it  is  an  ENTRECC  with 
a  COLOR  attribute.     The  first  creation  specification  says  to  build  a 
record  ('RECORD')  with  a  SUP  of  'RED',   and  set  an  attribute  (R  P)  in  a 
record  called  MEM  to  point  to  this  'RECORD'.     (For  this  example, 
assume  that  'RECORD'  is  R4.  ) 

Then  a  RELREC  is  created  with  a  SUP  of  'REL4',    a   PP  equal 
to  'R3'  (the  ENTY  of  ENTRECC),    and  a  PA  attribute  equal  to  RP  (MEM), 

42 


which  in  turn  is  pointing  to  the  RECORD  just  created.     The  RP(MEM) 

is  now  erased,  having  served  its  purpose  as  a  temporary  pointer.     The 

reason  it  was  needed  is  that  the  relation  structure  utilizes  a  different 

form  of  record  for  these  attributes  than  does  the  IDS,   as  may  be  seen 

in  Figs.   2  and  4.     For  example,   the  IDS  uses  a  named  record  'RED', 

whereas  the  relation  structure  needs  a  record  whose  SUP  attribute 

points  to  the  named  record  'RED',    so  this  record  must  be  created  by 

the  rule,   hence:    R4  ('RED'). 

Rule  25  then  erases  the  COLOR  attribute  from  S9  and  R3, 

and  places  S9  back  on  the  stack  along  with  the  new  RELREC: 

S10      RELREC  ('REL4',    PP='R3',    PA='R4') 

S9         ENTRECC  ('APPLE',    LIST  =  'STALIST',    LC  =  11,    ENTY  = 

'R3') 
S3         RECLISTC  (@11  =  'R3',    LASTREC  =  11,    LIST  =  'STALIST\ 

LC  =  12)       . 

In  processing  the  remainder  of  the  stack,   it  can  be  seen  that 
S10  is  put  on  the  relation  list,   and  S9  and  S3  are  both  deleted,   thus 
clearing  the  stack  and  ending  the  processing.     The  following  records 
have  been  created: 

RELLIST  (@11  =  'S6',    @12  =  'S7',    @13  =  ,S10',    LASTREC-13) 

56  ('REL1',    ACT  =  'R1',    PP  =  'R2',    PAST) 

57  ('REL6',    ACT  =  'R1',    PPOBJ='R3') 
S10  ('REL4',    PP='R3',    PA='R4') 

R4  ('RED') 

During  the  processing  of  the  records,   each  time  an  attribute 

was  erased  from  an  ACTRECC  or  ENTRECC,   that  same  attribute  was 

erased  from  the  original  record  pointed  to  by  the  ENTY  attribute  of  the 

ACTRECC  or  ENTRECC.     As  a  result  the  original  records  now  look 

like  this: 
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Rl      ('EAT') 
R2      ('JOHN') 
•     R3     ('APPLE') 

They  have  been  converted  to  records  that  express  only  the  concepts 
in  the  original  sentence,  which  is  exactly  what  is  needed  by  the  depen- 
dency relation  structure. 

Using  S6,    S7,    and  S10  along  with  the  records  they  reference, 
the  dependency  network  can  be  constructed: 

R2^iU>Rl  <-^-R3 

R4 
Substituting  the  "names"  of  the  named  records  yields: 

JOHN<^=>EAT«^—  APPLE 


RED 
which  can  be  recognized  as  the  same  conceptualization  represented  by 
the  English  sentence  at  the  beginning  of  this  example. 
2.       Explanation  of  Remaining  Rules 

The  remainder  of  this  section  will  deal  with  the  explanation 
of  the  processing  rules  in  Fig.   5  that  have  not  yet  been  mentioned. 

Rules  7  and  8  deal  with  the  recipient  case  of  the  dependency 
relations.     In  the  discussion  of  relation  7  a  new  conceptual  ACT 
"TRANS"  was  defined.     It  was  stated  that  the  direction  of  transition 
could  be  determined  by  noting  the  relation  between  the  actor  and  the 
donor  or  recipient.     For  the  concept  "give",  the  actor  and  the  donor 
are  the  same,    and  for  "take"  the  actor  and  the  recipient  are  the  same. 
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However,   in  an  action  record,    since  the  agent  is  already  specified, 
only  the  donor  or  recipient  need  be  specified  to  complete  the  conceptual 
relation.     So  when  an  ACTRECC  satisfies  the  condition  of  rule  7  (i.e.  , 
has  a  DONOR),   it  is  known  that  the  AGENT  is  the  RECIPient.     The 
verb  is  changed  to  'TRANS'  and  the  required  attributes  for  a  'REL7' 
are  created:    ACT  is  set  to  the  original  record  which  now  has  a  SUP 
of  'TRANS';  the  PPDON  attribute  points  to  the  DONOR  specified  by  the 
ENTY  of  the  current  ACTRECC;  and  PPREC  is  set  to  the  AGENT.     Then 
the  DONOR,  attribute  is  cleared  from  the  ACTRECC  and  its  ENTY,   and 
the  ACTRECC  is  placed  back  on  the  stack. 

Similarly,  if  the  ACTRECC  has  a  RECIP  attribute,  then  rule 
8  applies.  Here,  however,  the  PPDON  is  set  to  AGENT  and  PPREC  is 
the  RECIPient. 

Rules  9  and  12  deal  with  one  interpretation  of  the  directive 
case.     Consider  the  following  sentences: 

John  arrived  at  the  station. 

John  entered  the  station. 

John  left  the  station. 
The  verbs  in  the  first  2  sentences  cause  the  location  "station"  to  be 
thought  of  as  a  "destination", 'while  the  verb  "leave"  makes  the  location 
appear  to  be  the  "source"  of  travel.     Hence,    depending  on  the  particular 
verb,  the  source  and  destination  of  the  directive  case  may  need  to  be 
interchanged. 

In  NLP,   as  has  been  discussed,   the  preposition  associated  with  a 

location  is  considered  to  be  an  important  part  of  the  concept,    so  a 
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separate  record  is  utilized  to  hold  the  preposition.     In  a  record  that 
has  a  location  associated  with  it,   a  LOCATION  attribute  points  to  a 
record  which  has  a  SUP  of  the  specific  preposition  involved,   and  a 
LOCOBJ  attribute  pointing  to  the  actual  location. 

hi  rule  9,   then,   the  condition  that  it  be  applied  is  that  the 
ACTRECC  must  have  a  LOCATION  attribute  and  a  SUP  of  either 
'ARRIV  or  'ENTER'.     If  rule  9  is  applicable,   it  creates  a  'REL9' 
with  the  required  attributes  ACT,    PPSRC,   and  PPDES.     In  this  case, 
the  location  is  considered  to  be  the  destination,    so  PPDES  is  made  to 
point  to  the  location,  which  is  the  value  of  the  LOCOBJ  attribute  of 
the  record  pointed  to  by  LOCATION.     PPSRC  is  given  the  value  of  the 
SOURCE  attribute  of  the  action  record..    Rule  12  is  similar  to  this,   but 
the  source  and  destination  are  interchanged.     PPSRC  is  set  to  the  loca- 
tion,  while  PPDES  is  set.  to  the  DESTIN  of  the  action  record  being 
processed. 

Rules  10  and  11  handle  the  other  interpretation  of  the  directive 
case--that  in  which  the  concept  of  "motion"  is  involved. 

Rules  15  and  16  process  the  instrumental  case,   relation  8. 
These  rules  contain  some  creation  elements  that  have  not  been  encoun- 
tered before,   and  are  best  explained  by  example.     Consider  the  sentence 

John  broke  the  window  by  throwing  a  rock. 

and  the  two  ACTRECCs  (with  just  the  pertinent  information)  associated 

with  it: 

Rl     ('BREAK',    AGENT  =  'JOHN',    GOAL  =  'WINDOW,   INST^R2') 
R2      ('THROW,    AGENT-'JOIIN',    GOAL='ROCK') 
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Assume  for  the  moment  that  R2  comes  off  the  stack  first  (which  would 

be  the  case  if  the  order  of  the  original  action  records  had  been  reversed 

in  the  'ACTNLIST').     When  it  is  processed  by  rule  13  for  a  'REL1',   a 

ONEPTR  attribute  in  R2  would  be  set  to  point  to  the  'REL1'  record 

being  created,    say  R3,  which  now  represents  the  instrument  of  the 

conceptualization  being  considered.     R2  would  then  be  processed  by 

rule  14,   and  then  deleted  from  the  processing  list  (but  not  from  the 

system). 

When  Rl  comes  off,    it  also  is  processed  by  rules  13  and  14, 

and  then  rule  15  would  be  applied.     A  RELREC  would  be  created,    say 

R4,   with  a  SUP  of  'REL8'  and  an  ACT  attribute  pointing  to  the  ENTY 

of  Rl.     Then  an  INSTREL  (for  INSTrumental  RELation)  would  be  set 

to  point  to  R3,   which  is  the  value  of  the  ONEPTR  attribute  of  the 

record  pointed  to  by  INST  of  Rl.     That  is,    in  the  creation  specification 

INSTREL=ONEPTR(INST(ACTRECC)) 

INST(ACTRECC)  is  R2,   and  the  ONEPTR  of  R2  points  to  R3.     The   next 

element,    "-JNSTREL"  (called  a  "condition  on  the  right"),    says  that  if, 

at  this  point,   there  is  no  INSTREL  attribute  in  the  record,   then  execute 

the  elements  that  follow.     Otherwise,   cease  execution  of  this  rule  and 

go  on  with  the  processing.     Since  there  is  an  INSTREL  attribute  in  this 

record,   the  rest  of  the  rule  would  be  ignored  and  processing  would 

continue  with  R4  being  added  to  'RELLIST'.     It  has  all  the  required 

attributes  at  this  point: 

SUP  =  'REL8' 

ACT  =  (pointer  to  ENTY  of  Rl) 

INSTREL  =  'R3' 
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This  expresses  the  instrumental  relation  between  the  ACT  "break" 
and  the  instrument  R3,    "John  throw  rock". 

But  now,   going  back  to  the  initial  stack,    suppose  Rl  comes 
off  first  for  processing.     It  will  be  processed  by  rules  13,    14,   and  15. 
Rule  15  would  create  a  RELREC,    say  R  5,   with  a  SUP  of  'RELS'  and 
the  appropriate  ACT  attribute.     Then  the  next  creation  element  would 
set  the  INSTREL  attribute  to  the  ONEPTR  of  R2.     However,    since  R2 
has  not  yet  been  processed  b}r  rule  13,   its  ONEPTR  attribute  is  zero, 
so  the  INSTREL  is  zero.     This  time,   when  the  condition  "-.INSTREL" 
is  tested,   it  yields  a  value  of  "true"  because,    at  this  point,   the 
INSTREL  attribute  is  not  set.     Consequently,   the  next  creation  element 
is  executed.     It  sa}^s  to  set  a  BACK8    attribute  in  R2  to  point  to  this 
RELREC,    so  now  R2  is  as  follows: 

R2  ('THROW,    AGENT  =  'JOHN',    GOAL-'ROCK',    BACK8  =  'R5') 
R5  would  be  placed  on  'RELLIST',    but  would  be  incomplete  at 
this  point- -it  still  needs  an  INSTREL  attribute.     Rl  would  be  deleted  from 
the  processing  list,   and  R2  would  come  off  the  stack.     After  being  proc- 
essed by  rules  13  and  14,   R2  would  still  have  its  BACK8  attribute  plus 
a  ONEPTR  (from  rule  13)  pointing  to  the  'REL1'  created  by  rule  13. 
This  'REL1'  represents  the  instrument  of  Rl.     R2  would  be  picked  up  by 
rule  16  since  it  does  have  a  BACK8  attribute.     This  rule  would  set  the 
INSTREL  attribute  of  R5  equal  to  the  ONEPTR  of  R2  (which  is  pointing 
to  the  "instrument"),    and  then  erase  the  BACK8  attribute  and  place 
this  ACTRECC(R2)  back  on  the  stack  for  processing.     But  now  the 
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conceptualization  is  complete  in  the  relation  structure  because  the 
required  INSTREL  attribute  has  been  set  in  the  previously  created 
RELREC.     So  by  the  use  of  ONEPTR  and  the  "back-pointer"  BACK8, 
the  required  attribute  has  been  "back-stuffed"  into  the  relation  record. 

Rules  17  and  18  create  the  relation  record  for  "causality  "-- 
relation  10,    and  rules  20  and  21  handle  relation  12--that  of  "concur- 
rence".    Since  pointers  to  other  relations  are  involved,  the  same 
problems  arise  as  were  discussed  for  rules  15  and  16.     Consequently, 
the  processing  is  done  in  an  identical  manner  as  that  in  rules  15  and 
16.     The  only  difference  is  that  here  there  are  two  'RELl's   involved 
instead  of  one  'REL1'  and  an  ACT. 

Rule  19  builds  a  'REL11',   which  represents  the  time  of  a 
conceptualization,   and  rule  22  yields  a  'REL13',   which  gives  the 
location  of  a  conceptualization. 

The  rest  of  the  rules  in  Fig.    5  are  used  to  process  entity 
records.     Rules  24-28  all  create  'REL4's,    but  are  separated  because 
of  the  way  NLP  handles  this  type  of  attribute.     The  manner  in  which 
these  rules  are  applied  was  described  in  the  example  at  the  beginning 
of  this  section.     The  only  thing  in  these  five  rules  that  has  not  been 
mentioned  before  is  the  condition  specification  of  rule  24,    SIZE$'RELSIZ' 
This  specifies  that  the  SIZE  attribute  being  considered  must  be  in  the 
set  ($)  'RELSIZ'  ("relative"  size,   as  opposed  to  "absolute"  size). 

The  final  three  rules  in  this  group  (29-31)  deal  with  the  three 
parts  of  conceptual  relation  5. 
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Rule  29  is  concerned  with  location,   as  was  rule  22,   but  here 
it  is  the  location  of  an  entity,  rather  than  the  location  of  a  conceptuali- 
zation,  that  is  being  processed.     Again,   the  location  preposition  is 
prominent,     hi  both  rules  30  and  31,   the  preposition  is  ignored  because 
the  meaning  is  clear. 

Before  concluding  this  section,   one  further  remark  should 
be  made.     It  may  appear  that  rules  7-12  are  out  of  order,   but  this  is 
not  the  case.     They  each  require  information  about  some  attribute  in 
the  action  record  that  gets  erased  in  later  processing.     For  example, 
rules  7  and  8  both  require  the  AGENT  attribute,    but  if  rule  13  had 
processed  the  record  first,   there  would  no  longer  be  an  AGENT  attri- 
bute. 

C.         RELATION-TO- ATTRIBUTE  PROCESSING 

The  procedure  for  converting  information  from  the  relation 
structure  form  to  the  entity -attribute -value  form  is  much  simpler  than 
the  inverse  procedure  just  discussed.     There,   because  of  the  uncer- 
tainty of  the  content  of  the  input,    a  rather  complicated  process  was 
required  to  obtain  all  the  pertinent  information  for  the  construction  of 
the  relation  records.     Since  the  relation  records  have  a  precisely 
defined  content,   it  is  a  fairly  straightforward  process  to  set  the  appro- 
priate attributes  in  the  action  and  entity  records.     It  must  be  stressed, 
though,   that  only  simple  cases  have  been  considered  in  this  report. 

The  rules  for  this  processing  are  shown  in  Pig.   6.     The  proces- 
sing of  records  here  follows  the  same  general  pattern  as  discussed 
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earlier:    the  records  are  placed  on  a  stack,   then  one  at  a  time  they 

are-removed.     The  rules  are  searched  to  find  the  one  that  is  applicable 

to  the  current  record,   and  the  creation  specifications  of  that  rule  are 

executed.     The  difference  at  this  point  is  that  once  the  rule  has  been 

applied,   the  record  being  processed  is  no  longer  needed,    so  does  not 

go  back  on  the  stack.     This  means  that  each  REL  record  is  processed 

only  once. 

1.      Rule  Processing  Example 

To  illustrate  this  processing,   the  relation  structure  for  the 

conceptualization  used  in  the  previous  section  will  be  processed 

(referring  to  Fig.    6).     The  records  representing  the  structure  are: 

RELLIST  (@11  =  ,R1',@12  =  'R2,,@13  =  'R3',  LASTREC  =  13) 

Rl  ('REL4',    PP='D6',    PA='D7') 

R2  ('REL6',    ACT  =  'D4',    PPOBJ='D6') 

R3  ('REL1',    ACT  =  'D4\    PP  =  'D5',    PAST) 

D4  ('EAT') 

D5  ('JOHN') 

D6  ('APPLE') 

D7  ('RED') 

(The  record  designations  have  been  changed,   but  the  records  themselves 
are  identical  to  the  previous  ones.) 

Rule  34  puts  a  copy  of  'RELLIST'  on  the  stack  and  sets  LC 

to  11: 

51  RECLISTD(@11  =  'R1',@12  =  ,R2,,@13  =  'R3',    LASTRE013, 

LC=11) 

SI  comes  off  the  stack  and  rule  35  puts  a  RELRECC  on  the  stack  (a 
copy  of  @11  of  RECLISTD)  along  with  SI,   which  now  has  LC  =  12: 

52  RELRECCCREL4',    PP='D6\    PA='D7') 

SI      RECLISTD(@11  =  'R1',@12  =  'R2',@13  =  'R3',    LASTREC  =  13, 
LC  =  12) 
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When  the  top  record  (S2)  comes  off  the  stack,  the  rules  are 
searched  for  one  in  which  the  condition  specifications  are  satisfied, 
and  rule  40  is  found  to  apply,    so  the  right  side  of  the  rule  is  executed. 
The  rather  complicated  expression  in  rule  40  bears  some  explanation, 
but  first  a  word  about  its  purpose. 

A  'REL4'  expresses  the  relationship  between  a  PP  and  a  PA. 

Currently,   in  NLP  all  PAs  are  realized  as  adjectives  and,    as  such, 

they  all  represent  attributes  such  as  size,   color,   quantity,    etc.     The 

value  of  an  ATTRIB  attribute  is  the  name  of  one  of  these  attributes. 

For  example,   consider  the  named  record  definitions 

RED('ABSCOLR') 

ABSCOLRCQUALVAL',    ATTRIB='COLOR ') 

These  say  that  'RED'  is  in  the  set  'ABSCOLR',   and  'ABSCOLR'  is  in 

the  set  'QUALVAL'  with  an  ATTRIB  attribute  of  'COLOR'. 

It  can  be  seen,   then,   that  given  a  concept  such  as  "red",   the 

type  of  attribute  it  represents  may  be  determined  by  searching  its  SUP 

chain  until  an  ATTRIB  is  found,    and  the  value  of  ATTRIB  will  be  the 

name  of  the  attribute  that  the  concept  represents.     The  first  part  of 

the  right  side  of  rule  40  does  precisely  this.     It  says  to  take  the  PA  of 

the  current  RELRECC,    get  its  SUP,   and  search  the  SUP  chain  ($)  of 

that  SUP  until  an  ATTRIB  attribute  is  found.     Then  cause  this  attribute 

(@)  of  the  PP  of  RELRECC  to  point  to  the  SUP  of  the  PA  of  RELRECC. 

Applying  this  rule  to  S2,   the  processing  proceeds  as  follows:    The  PA 

of  S2  points  to  D7;  the  SUP  of  D7  is  'RED';  searching  the  SUP  chain  of 

'RED'  until  an  ATTRIB  attribute  is  found  yields  'COLOR-';  set  the 
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COLOR  attribute  of  D6  to  the  SUP  of  D7.     Record  D6  now  looks  like 
this: 

D6  ('APPLE',    COLOR  =  'RED') 
and  the  stack  has  been  reduced  to  SI. 

Again,   rule  35  is  applied,  yielding 

53  RELRECCCREL6',    ACT='D4',    PPOBJ='D6') 

SI      RECLISTD(@11  =  'R1',  @12  =  'R2  ',  @13  =  'R3',  LASTREC  =  13,  LC  =  13) 

S3  is  a  'REL6',    so  rule  44  applies.     The  GOAL  attribute  of 
the  ACT  specified  in  S3  is  set  to  the  PPOBJ  of  S3,   which  makes  the 
GOAL  attribute  of  D4  equal  to  'D6': 

D4  ('EAT',   GOAL='D6') 

Rule  35  is  applied  once  more,   putting  a  copy  of  R3  on  the 
stack  and  incrementing  LC  of  SI  to  14: 

54  RELRECCCREL1',    ACT  =  'D4\    PP='D5',    PAST) 

SI      RECLISTD(@11  =  'R1',  @12  =  'R2',  @13  =  'R3',  LASTREC  =  13,  LC  =  14) 

Record  S4  is  taken  from  the  stack,    and  rule  37  causes  the 
AGENT  of  D4  to  point  to  'D5',   then  sets  the  VERBPHIND  of  D4  to  "PAST" 
D4     ('EAT',   GOAL  =  'D6',    AGENT  =  'D5\    PAST) 

Finally,   SI  comes  off  the  stack  and  can  no  longer  be  processed 
by  rule  35  because  LC  is  now  greater  than  LASTREC,    so  rule  36  is 
applied,   leaving  the  stack  empty  and  completing  the  processing. 

The  action  record  (D4)  and  those  records  that  it  references 
(D5  and  D6)  make  up  the  IDS  representation  of  the  conceptualization 
"John  ate  a  red  apple.  " 
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2.       Explanation  of  Remaining  Rules 

The  rest  of  this  section  describes  the  remainder  of  the  rules 
shown  in  Fig.   6. 

Rule  38  processes  a  'REL2',   which  represents  the  second 
dependency  relation  in  Fig.   3.     The  English  realization  of  this  relation 
is  either  a  sentence  (e.g.,    "John  is  big.  ")  or  a  relative  clause  ("John, 
who  is  big,    .    .    .   "),   so  rule  2  handles  both  possibilities.     A  record  is 
created  with  a  SUP  of  'BE',    and  attributes  of  SUBJECT,    PREDADJ, 
and  VERBPHIND,    and  the  RELCL  (relative  clause)  attribute  of  the  PP 
is  made  to  point  to  this  record.     Next,   a  check  is  made  on  SENT(MEM), 
an  attribute  that  contains  a  pointer  to  a  record  which  contains  informa- 
tion for  the  main  clause  of  a  sentence.     In  this  case,   if  SENT(MEM) 
has  a  value,   then  the  PP  of  RELRECC  is  already  designated  to  be  put 
out  as  part  of  a  sentence,    so  the  record  just  created  will  be  processed 
as  a  relative  clause  modifying  that  PP.     Otherwise,   the  RELCL  attri- 
bute of  the  PP  will  be  cleared  and  this  record  will  be  put  out  as  a 
sentence  by  itself  ("John  is  big.  "). 

Rule  39  does  the  same  processing  for  a  'REL3 '--"John  is  a 
doctor."  or  "John,  who  is  a  doctor,    .    .    .    ". 

Rule  41  sets  the  LOCATION  attribute  of  the  PP  to  point  to 
the  preposition  record,  and  the  LOCOBJ  attribute  of  the  preposition 
record  to  the  actual  location. 

In  creating  the  'REL5B'  relation  record  from  the  entity 
record  (rule  30),   the  preposition  was  ignored  because,    conceptually, 
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it  is  understood.     However,  the  preposition  is  needed  to  complete  the 
representation  in  the  IDS,    so  it  must  be  supplied.     Rule  42  does  this 
by  creating  a  record  with  a  SUP  of  'IN'  and  a  LOCOBJ  attribute  point- 
ing to  the  "container".     Then  the  LOCATION  attribute  of  the  main  PP 
is  made  to  point  to  this  new  record. 

Rule  43  sets  the  OWNER  attribute  of  the  main  PP  equal  to 
the  PPOWN  attribute  of  the  'REL5C1,   which  points  to  the  "possessor" 
of  the  main  PP. 

Rules  45-47  process  the  'REL7'  relation,   or  recipient  case. 
The  option  is  given  here  of  either  changing  the  verb  to  'GIV  or   'TAK' 
(whichever  is  appropriate)  or  leaving  it  as  'TRANS'.     The  option  is 
determined  by  the  setting  of  the  indicator  VERBSW(MEM).     If  it  is  set, 
either  'GIV  or   'TAK'  will  be  the  action  in  the  action  record.     Other- 
wise,  the  action  will  be  'TRANS'. 

In  rules  48  and  50-53,   each  of  the  relation  records  references 
another  relation,    but  all  of  these  are  realized  in  the  entity -attribute- 
value  structure  as  attributes  of  an  action  record.     Hence,   these  rules 
reference  the  ACT  of  the  relations. 

Rules  49  and  54  simply  set  attributes  in  the  appropriate 
records.  The  final  two  rules,  55  and  56,  dispose  of  RECORD  and 
NULL  records. 

D.         EXAMPLES  OF  RULE  APPLICATION 

This  section  presents  examples  of  conceptualizations  that  were 
processed  by  the  rules  described  in  the  two  preceding  sections.     The 
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rules  were  compiled  and  executed  by  NLP  on  the  CP/CMS  time-sharing 
system  of  the  IBM  360/67.     The  conceptualizations  were  input  to  the 
system  as  named  record  definitions  in  the  form  that  would  be  available 
to  the  rules  if  NLP  had  decoded  the  sentences. 

For  example,   the  data  presented  to  the  ATTRIBUTE-TO-RELATION 
rules  was  in  the  attribute -value  form  of  NLP,   while  that  given  to  the 
RELATION-TO- ATTRIBUTE  rules  was  in  the  format  defined  here  for 
the  dependency  relation  structure. 

Figures  7  and  8  show  the  input  to  the  rules  and  the  output  obtained 
for  the  conceptualization 

"John  gave  a  big  red  book  to  the  little  girl.  " 
The  dependency  relation  network  for  this  conceptualization  is 


p  o  ti 

John<^=^>  giv  < book  <; — 


/K     /\ 


^-  little 
John 


big  red 
(Note  in  Fig.    7.,   that  after  processing,    'Rl'  has  a  SUP  attribute  of 
'TRANS'.     Also,   in  Fig.   8.,   if  VERBSW(MEM)  had  not  been  set,     'D2' 
would  have  a  SUP  of  'TRANS'.  ) 

Figure  9  and  10  show  the  input  and  output  for 

"John  hit  Mary  on  the  swing  in  the  park  yesterday  while 
the  dog  barked" 
and  its  conceptual  network. 
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yesterday  park 

John     <-£->        hit  <r^—  Mary 


on 


dog       <t±L^>     bark  swing 

The  records  in  the  output  of  each  example  that  are  labeled  "C-" 
are  records  that  were  created  during  processing  by  the  rules.     The 
"C"  designations  are  being  used  here  to  represent  the  actual  numerical 
values  given  to  the  records  by  the  system. 
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INPLT 


PLUR   15 


INDICATORS: 

VERBPHIND   13-33 

PAST   13, 
VERBSW   49 

ATTRIBUTES: 

SUP    1 


NAMED    RECORDS: 

RELLIST     (LASTREC=10) 
■     ACTNLIST    (ail=eRl5,    LASTREC=11) 
MOBL 1ST  ?=  »  r  a • 

STALIST 


{ 


{ 


Rl 
R2 

R3     ( 
R4    ( 

BIG 
RED 
LITTLE 

RELSIZ 


«GI  V 
« BOCK" 
•GI RL' 
4  JOHN « 
(! RELS 
(  ' 


{  a  1 1  =  <  R  2  ' 


,     LASTREC^ll) 
AGENT='R4«,    GOAL='R2«s     RECIP=«R3» 
,     COLCR=,RED,t     SIZE='BIG«) 
t     SI  ZE  =  '  LITTLE'  ) 
) 

IZ«  ) 
ABSCOLR' ) 
(  'RELSIZ* ) 
CRELVAL'  ,     ATTRIB  =  6S  IZE«  ) 


PAST) 


ABSCOLR  {'QUALVAL',  ATTR I B = « COLOR' ) 


OUTPUT 


RELLI  ST 


CI 
C2 
C3 
C4 
C5 
C6 
C7 
C8 
C9 
Rl 
R2 
R3 
R4 


(«  REL7« 

(  'REL1'  » 

( 'REL6'  , 

I  «REL4'  , 

'REL4' , 

'REL4'  , 

' LITTLE 

'BIG' ) 

'RED' ) 

'TRANS' 

'BOOK' ) 

'GIRL'  ) 

'JOHN' ) 


Ol  1  =  "  CI'  , 
ai5=,C5«» 


ACT  =  «  Rl 
ACT='R1 
ACT='R1 
PP='R3' 
p  p  =  » p  2  * 
PP  =  «R2« 
) 


312  =  *  C2«  , 
ai6=«C6' , 
PPDON=« 
PP=« R4' 


PPOBJ= 
PA=«C7  ' 
PA='C8« 
PA= 'C9« 


,  ai4=«C4« 
LASTREC=16) 

R4« ,  PPREC=  !R3'  ) 
,  PAST) 
R2e  ) 


EXAMPLE  OF  ATTRIBUTE-TO-RELATION  FRCCESSING 
"JOHN  GAVE  A  BIG  RED  BOOK  TO  THE  LITTLE  GIRL." 

FIGURE  7. 
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INPUT 


INDICATORS: 

VERBPHIND   13-33 

PAST   13,    PLUR 
VERBSW   49 

ATTRIBUTES: 

SUP    1 


15 


NAMED  RECORDS: 

RELLIST  .(311=« Rl» , 


Rl 
R2 
R3 
R4 
R5 
R6 
Dl 
D2 
D3 
D4 
D5 
D6 
D7 
BIG 


3 1 5=  •  R  5  «  , 
PP  = 


312  =  9R2S  313=«R3 


'REL1« 
•REL6' 

•RCL7« 

«REL4« 
i 


) 


) 


•REL4 

•REL4' 
{«  JCHN« 
( "TRANS 
{ «BOOK» 
C 'BIG' ) 
( 'RED') 
( 'GIRL'  ) 
('  LITTLE* 

( 'RELSIZ 


8D1 
ACT=*D2 
ACT=«D2 
PP  =  »D3 8 
PP  =  • D3  c 
PP=  tD6e 


316=eR6« ,  LASTREC=16 


ACT=,D2S  PASTJ 

PP0BJ=«D3« ) 

PPDON=8Dl«  t 

) 

) 

) 


♦  £14='R48, 
VERBSW (MEM) ) 


PA=  «D4« 
PA=« D5« 
PA  =  «D711 


PPREC=»D6« ) 


) 


LITTLE  J«RELSIZ»: 
RED  {'A3SCOLR8) 
RELSTZ  ("RELVAL", 
ABSCGLR  PQUALVAl 


ATTRIB=' SIZE1 ) 

,  ATTRIB='COLOR*) 


OUTPUT 


Dl  (  •JOHN1 

D2  ('GIV«, 

D3  ('BOCK' 

D6  {  'GIRL' 


AGENT=,D1',  GOAL=8D3« 
SIZE='BIG«,  CCLOR=« 
SIZE=« LITTLE8 ) 


,   RECIP  = 

RED8  ) 


■D6« ,PAST) 


EXAMPLE  OF  RELATION-TO  ATTRIBUTE  PROCESSING 
"JOHN  GAVE  A  BIG  RED  BOCK  TO  THE  LITTLE  GIRL." 

FIGURE  8. 
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INPUT 


INDICATORS: 

VERBPHIND       13-33 

PAST       13,  PLUR       15 

VERBSW       49 

ATTRIBUTES: 

SUP    1 


NAMED 


RECORDS: 
ACTNLIST 
MOBLIST 
STALIST 
R 1  ('HIT 


(311=' 

(aii=«R 
can  =• 


Rl 


,  ai2=»R2' 
312=»R4«, 
312=' R8B , 


,  LASTREC=12) 
LASTREC=12) 
LASTREC=12) 


R2 
R3 
R4 
R5 
R6 
R7 
R8 
R9 
RIO 


agent='r: 

TIME=' R9 


•BARK« 
"JOHN' 
• MARYE 
•ON8  , 
•SWING 
•IN'  , 
•PARK* 
•YESTERDAY 
(«DCG«  ) 


AGENT=«R10« ,     PAST) 


G0AL='R4«,    CONCURR=«R28 
LOCATICN= 'R71 ,     PAST) 


,     LOCATICN=«R5 

LOCOE^'Re1 ) 

•  ) 

LGCOBJ=*  R8«  ) 

) 

) 


) 


OUTPUT 


RELLIST  (311= 
315=' 
•REL1* 
•REL6e 
'REL11 
•REL12 
•REL13 


CI 
C2 
C3 
C4 
C5 


«C1«  *     D12  =  «C2?  ,    ai3='C3« 
C5%     ai6=«C6e,     ai7=«C7e, 

ACT  =  '  Rl'  ,     PP  =  »  R3«  ,     PAST) 


ACT=«R1 


PPCBJ=,R4' ) 


314=« C4« , 
LASTREC=17) 


TIME=«R9',     MAINREL=«C1S  ) 


C6 
C7 

Rl 
R2 
R3 
R4 
R5 
R6 
R7 
R8 
R9 
RIO 


*REL1S 
■REL5A 

•HIT') 
•BARK'  ) 
•JOHN"  ) 
•MARY' ) 
•OK"  ) 
•SWING  •  ) 
•IN'  ) 
«  PARK'  ) 
•YESTERDAY 
(' COG8  ) 


MAINREL=»C1 
MAINREL='C1 
PPLCC  =  '  R8'  ) 
ACT='R2' t  PP 
LCCPREP=SR5 
PPMAI N  =  8  R4« 


DEPREL=«C6« ) 
L0CPREP=»R7* » 


RIO1 


PAST) 


PPL0C=»R6« 


EXAMPLE    OF    ATTRIBUTE-TO-RELATION    PROCESSING 

"JOHN    HIT    MARY    ON    THE     SWING    IN 
THE    PARK    YESTERDAY    WHILE    THE    DOG    BARKED." 


FIGURE    9. 
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INPUT 


INDICATORS: 

VERBPHIND   13-33 

PAST   13,    PLUR   15 
VERBSW   49 


ATTRIBUTES: 

SUP    1 

NAMED  RECORDS: 

RELLIST  (211=eRl5 

?J  1  5  =  «  R  5  • 


Rl 
R2 
R3 
R4 
R5 
R6 
R7 
Dl 
D2 
D3 
D4 
D5 
D6 
D7 
D8 
D9 
DIO 


(  'REL1'  t 

(«  REL6'  , 

( «REL5A« 

•REL13' 

•RELll* 

■REL12' 


PP= 'Dl« 


ai2=,R2S  ai3=sR3« 
316='  R66  ,  ai7  =  «  R78 
ACT='D28,  PAST) 
ACT=»  C2«  ,  PPCBJ='C3 
PPMAIN='D8'  , 
MAINREL=,R1« 
MAI NREL=' Rl8 
MAINREL='R1 


RELI» 
JCHN' 
HIT'  ) 
CN'  ) 

DOG'  ) 
BARK' 
PARK' 


PP='D48,  ACT='D5«,  PAST) 


aiA=«R4' , 
LASTREC=17) 


PPLOC=«D9«,  LOCPREP=« D3 ' ) 
L0CPREP=»D1Q« ,  FPL0C='D6« 
TIME=»  D71  ) 
DEPREL='  R7«  ) 


YESTERDAY' ) 
MARY' ) 
SWING1 ) 
8  IN'  ) 


OUTPUT 


Dl 
D2 

D3 
D4 
D5 
D6 
D7 
D8 
D9 


•JOHN' 
•HIT' , 

«  0 1\  •  , 
'DOG' ) 

'BARK' 
•  PARK8 


AGENT='D1',     LCCATICN='D10e ,     GCAL=8 


C0NCURR='D5' 
LOCOBJ=  8C9  8 ) 


TIME='D7' ,    PAST) 


) 


AGENT='D4',    PAST) 


' YESTERDAY' ) 
'MARY8,     LOCATION; 
' SWING1 ) 


«D3«  ) 


DIO  (  'IN8  ,  LOCOBJ  =  'D68 ) 


EXAMPLE  OF  RE LAT  I  CN-TO-ATTRI BUT E  PROCESSING 

"JOHN  HIT  MARY  ON  THE  SWING  IN 
THE  PARK  YESTERDAY  WHILE  THE  DOG  BARKED." 


FIGURE  10. 
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V.     CONCLUSIONS 

The  objective  of  this  research,    as  stated  in  the  INTRODUCTION, 
has  been  met.     First,    a  means  was  devised  for  representing,   within 
the  framework  of  NLP,   the  conceptual  structure  defined  by  Conceptual 
Dependency  theory.     Then  two  sets  of  rules,  written  in  the  rule  language 
of  NLP,  were  developed.     The  first  set  converts  the  attribute -value 
representation  of  information  to  dependency  relation  form,   and  the 
second  set  converts  the  dependency  relation  representation  of  informa- 
tion to  attribute -value  form. 

Most  of  the  information  required  by  the  conceptual  structure  is 
available  in  the  existing  structure  of  NLP,   and  that  which  is  not  currently 
available  could  be  made  available  by  making  some  minor  modifications  to 
NLP's  present  set  of  English  processing  rules. 

It  is  recommended  that,   if  further  use  is  to  be  made  in  NLP  of  the 
conceptual  dependency  structure,    consideration  be  given  to  modifying 
the  processing  rules  of  NLP  so  that  they  will  produce  the  conceptual 
structure  directly,   rather  than  generating  the  present  form  of  represen- 
tation and  then  converting  that  to  the  conceptual  structure. 
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the  structures  of  Conceptual  Dependency  theory  in  NLP,    and  to  develop 
methods  for  conversion  of  information  between  NLP's  existing  repre- 
sentation and  that  of  Conceptual  Dependency  theory. 
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